Estimate of the correlation signal between cosmic rays and BL Lacs in future data 
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The existing correlation between BL Lacertae objects (BL Lacs) and cosmic-ray events observed by 
HiRes experiment provide sufficient information to formulate quantitatively the hypothesis about 
the flux of neutral cosmic-ray particles originated from BL Lacs. We determine the potential of 
future cosmic ray experiments to test this hypothesis by predicting the number of coincidences 
between arrival directions of cosmic rays and positions of BL Lacs on the celestial sphere, which 
should be observed in the future datasets. We find that the early Pierre Auger data will not have 
enough events to address this question. On the contrary, the final Pierre Auger data and the early 
Telescope Array data will be sufficient to fully test this hypothesis. If confirmed, it would imply the 
existence of highest- energy neutral particles coming from cosmological distances. 

PACS numbers: 98.70.Sa, 98.54.Cm 



I. INTRODUCTION 

The origin of particles with energies exceeding 10^^ eV 
(hereafter ultra-high-energy cosmic rays, or UHECRs) 
is one of the most intriguing questions of astroparti- 
cle physics (for a review, see e.g. Ref. [1]). Within 
conventional-physics explanations the existence of these 
particles is attributed to acceleration in yet unknown as- 
trophysical sites. Various candidates have been proposed 
(see e.g. Ref. [2], for a recent review). Poor (in astronom- 
ical standards) angular resolution of the cosmic-ray ex- 
periments makes it impossible to identify actual sources 
directly. Instead, one has to rely on statistical methods. 

A clean and indisputable way to perform a statistical 
analysis is to formulate a hypothesis about the sources 
before the data are released. This was not possible to do 
with the first data sets. At present, however, there ex- 
ists enough data to specify the class of likely sources. 
More important, new high-quality data are expected 
from cosmic-ray experiments in the very near future. Be- 
fore these data are released, a hypothesis to be tested 
should be formulated and the procedure to verify or fal- 
sify it should be defined. In particular, it is important 
to define quantitatively which results of a particular ex- 
periment would test the hypothesis at a given confidence 
level. In this note we determine such a quantitative pro- 
cedure. 

During the last ten years, many classes of astrophys- 
ical objects were tested for positional correlations with 
arrival directions of UHECRs (see Ref. [3] for a compar- 
ative study). Significant correlations were found [4-6] 
with different samples of BL Lacs from the Veron cat- 
alog [7]. The correlations with BL Lacs appear in new 
independent data sets as well. Most recently the correla- 
tions between bright confirmed BL Lacs and the arrival 
directions of events observed by the HiRes stereo exper- 
iment, were found [8] and confirmed by an independent 
method [9] . This case is particularly clean for it involves 



minimum of assumptions concerning the BL Lac sample 
and propagation of UHECR particles (the latter are as- 
sumed neutral and propagate along straight lines). Thus, 
it is most suitable for a rigorous test with the future data. 
The purpose of this paper is to formulate the testing pro- 
cedure before the data are released. 

We specify the hypothesis in Sec. II and formulate the 
procedure to test it in the following sections. In Sec. Ill, 
we describe how to estimate the experiment-independent 
quantitative parameter involved in the hypothesis on the 
basis of a single experiment (HiRes stereo in our case). 
In Sec. IV, we estimate the sensitivity of future experi- 
ments to the suggested hypothesis and determine which 
results would allow either to exclude or to confirm it at 
a given confidence level. We calculate the correspond- 
ing parameters for the final AGASA, new HiRes stereo, 
Pierre Auger and Telescope Array data sets. 



II. THE HYPOTHESIS TO BE TESTED 

The most precisely determined published arrival direc- 
tions were provided by HiRes in the stereo mode [10]. 
They recorded 271 events with energies E > 10^^ eV, 
which were found [8] to correlate with bright (visual mag- 
nitude TO < 18) confirmed BL Lacs from the 10th edition 
of the Veron catalog [7] — the sample previously iden- 
tified [5] as correlated with cosmic rays detected by the 
AGASA and Yakutsk experiments. With the angular 
resolution (in the stereo mode) of 0.6°, the strongest cor- 
relation signal was expected at 0.8° as determined by 
Monte-Carlo simulations in ReL [8]. There were 11 pairs 
"BL Lac - cosmic ray" found with the separation less 
than 0.8° while in average 3.5 pairs were expected for a 
randomly distributed sample. The angular separations 
observed are much smaller than a typical deflection of a 
charged particle of that energy in the Galactic magnetic 
field, so the correlation may only be explained by the 
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presence of a fraction of neutral primary particles [8] . 

One can summarize these results in the form of a hy- 
pothesis suitable for testing by other experiments. The 
hypothesis consists of three assumptions: 

i) Some fraction of cosmic-ray events with energy E > 
10^^ cV corresponds to neutral primary particles. 

ii) A part of these neutral particles, or their progen- 
itors, are emitted by the confirmed BL Lacs with 
magnitude m < 18 marked "BL" in the catalog [7]. 
There are 156 such objects. The neutral particles 
from BL Lacs compose the fraction r] of the ob- 
served UHECR flux. 

iii) The observed cosmic-ray fluxes of these 156 sources 
are roughly equal. 

Several remarks concerning the assumptions i)-iii) are 
in order. Firstly, we determine the value of 77 from the 
HiRes data. Secondly, the energy threshold of iJ = 
10^® eV adopted in i) was used in Ref. [8] because only 
the arrival directions of the events with E > 10^^ cV were 
published in Ref. [10]. Further studies [9] demonstrated 
the presence of the correlations in an independent (un- 
published) data set with E < 10^^ cV. This represents 
however a different claim which we do not discuss here; 
it can be analyzed in a similar way. When considering 
other experiments in what follows we assume the same 
cut on energy. This assumption may be important for the 
validity of the predictions, as the fraction of neutral par- 
ticles (which exhibit the neutral correlations) may vary 
with energy. If these variations are strong, the energy de- 
pendence of the acceptance of the experiments has to be 
taken into account even at the same energy cut. Also, it 
is necessary that absolute energy calibrations of different 
experiments match each other. Otherwise this should be 
taken into account when selecting events for testing the 
hypothesis (it is worth noting that, currently, the spectra 
obtained by different experiments do not match). 

Thirdly, the assumption ii) does not exclude other pos- 
sible sources of UHECRs and means only that BL Lacs 
emit some of the cosmic-ray particles. We will see shortly 
that according to HiRes data the fraction rj of neutral 
UHECRs attributed to BL Lacs is actually small, of or- 
der 2%. 

Finally, the assumption (iii) is a technical one; it may 

be replaced by any other particular conjecture on the rel- 
ative cosmic-ray brightness of BL Lacs in the sample. In 
fact, our analysis requires only that in the flux of cos- 
mic rays observed by a given experiment the fraction of 
events which are associated with BL Lacs (this fraction 
equals r] if the experiment sees the whole sky with the 
uniform acceptance) is proportional to the number of BL 
Lacs covered by the acceptance area. 

Our further logic is as follows. From each cosmic-ray 
experiment, one can put bounds on the fraction 77 un- 
der the assumptions (i)-(iii). Two experiments could, 
in principle, put the bounds which are, with some prob- 
ability p, inconsistent with each other. Leaving aside 



the possibility that one of the experiments is wrong, this 
would mean the rejection of the set of assumptions at the 
confidence level p. 

III. ESTIMATING r] FROM THE HIRES DATA 

Consider a given catalog of M astrophysical objects 
(candidate sources) with celestial coordinates {ai,6i}, 
i = 1, . . . , M, and a cosmic-ray experiment. Depending 
on the geographical location, field of view and observa- 
tion time this experiment has different exposures to par- 
ticular regions of the sky. This can be parametrized by 
the direction-dependent differential exposure dA{a,6), 
which we normalize to one, 

j dA{a,6) = 1. 

The integration here runs over the whole celestial sphere. 

Under certain assumptions [17], the number n of ob- 
served pairs "source cosmic ray" separated by angular 
distances smaller than some (small) angle is described 
by the Poisson distribution, 

P^[S,B]=^-^±P^.e-^'+^\ (1) 

where B denotes the average number of background 
events (that is, either originated from the sources not 
contained in the catalog or deflected) and S denotes 
the average number of events which originate from the 
sources in the catalog (we call S the signal). 

The background events are supposed to be isotropically 
distributed and detected according to the experiment's 
exposure, hence 

B{e)=NY, [ dA, (2) 

where the sum is taken over all sources in the catalog, fli 
is a circle of the radius 9 centered on i-ih source and N 
is the total number of cosmic-ray events in the data set. 
This can be written as 

B{9) = ^ NMF, (3) 
47r 

where 




is a geometrical "correction factor" which describes the 
distribution of the objects from a given catalog, with re- 
spect to the acceptance of the experiment: it equals one 
for the uniform distribution and is smaller (larger) than 
one if there is an underdensity (overdensity) of sources 
within the acceptance area. For small 6 it does not de- 
pend on 9. The exact values of F for particular experi- 
ments and a particular catalog of sources can be obtained 



3 



Experiment 


a 


Rof. 


F 


HiRes (stereo) 


0.6° 


[10, 11] 


1.38 


AGASA 


2.4° 


[12] 


1.37 


PAO (surface) 


1.4° 


[13, 14] 


0.53 


PAO (hybrid) 


0.6° 




0.48 


TA (surface) 


1.55° 


[15] 


1.41 


TA (hybrid) 


0.62° 




1.51 



TABLE I: Parameters of the cosmic-ray experiments, cr is 
the angular resolution of the experiment (the radius of the 
circle containing 68% of the reconstructed events from a point 
source). Reference is given for the value of a and for the 
details of direction-dependent exposure. See Eq. (4) for the 
definition of F. 



from Eq. (4) ; for the BL Lac sample under discussion and 
some ongoing and planned experiments they are listed in 
Table I. 

The signal S is different for different experiments. Let 
us express it in terms of the fraction r}. For S <^ N, the 
expression is linear[18]. The coefficient of proportionality 
depends on the acceptance of the experiment, the angular 
resolution (including the shape of the point-spread func- 
tion) and on the angle at which the signal is measured. 
The exact expression reads 



S{9) = g{e)FNf], 



(5) 



where the factor g{9) is the integrated point-spread func- 
tion of the experiment: this is the fraction of the events 
observed within the angle 9 from the position of the point 
source. For instance, for 9 equal to the angular resolu- 
tion (by definition, the radius of a circle containing 68% 
of the events) one has g{9Qs) = 0.68 [19]. 

In a given experiment one measures n and has to de- 
termine rj. The likelihood function for ry can be obtained 
from Eq. (1) by normalizing P„ to 1, £{r])dr] = 1. One 
has 



gFN 



T{n+1,B) 



{gFNri + Bf e 



-gFNri-B 



(6) 



For the original HiRes stereo sample [10], B — 3.5 and 
n = 11 at 61 = 0.8° [8]. At the 95% C.L. one has 0.015 < 
?7 < 0.035. Here and in the estimates of the next section 
we have set 9 ~ \/2cr, where a is the angular resolution of 
the relevant experiment (for the discussion of the optimal 
choice of 9 see Refs. [4, 8] and Sec. IV). 



IV. PREDICTIONS FOR FUTURE 
EXPERIMENTS 

With two different experiments X and Y, one ob- 
tains two likelihood functions, Lx{ii) and Cyir])- The 
two-dimensional probability distribution function corre- 
sponding to the observation of rji in the experiment X 
and r)2 in Y is simply 



Experiment 


9 = V2a 


N 


s 


B 


^"•95% 


iVo 


HiRes, original 


0.85° 


271 


7.46 


3.54 






HiRes (stereo) 


0.85° 


190 


5.23 


2.48 


1-17 


271 


AGASA 


3.39° 


1500 


43.4 


310 


308-417 


3870 


PAO (surface) 


1.98° 


500 


5.50 


13.5 


10-31 


3517 


PAO (surface) 


1.98° 


8000 


87.9 


216 


239-413 




PAO (hybrid) 


0.85° 


150 


1.86 


0.68 


0-7 


467 


PAO (hybrid) 


0.85° 


2000 


24.8 


9.10 


15-66 




TA (surface) 


2.19° 


500 


15.0 


44.4 


42-83 


1560 


TA (surface) 


2.19° 


8000 


239 


710 


785-1235 




TA (hybrid) 


0.88° 


150 


4.40 


2.28 


1-15 


277 


TA (hybrid) 


0.88° 


2000 


58.6 


30.3 


47-161 





TABLE 11: Predictions (95% CL) for the number Angg^ of 
pairs "cosmic ray - BL Lac" separated by less than 9 to be 
observed by future experiments for the catalog of 156 BL 
Lac's. The sets of UHEGR axe constrained by the cuts on 
energy {E > 10^^ eV) and on zenith angle {z < 60° for the 
Pierre Auger Observatory (PAO) and the Telescope Array 
(TA), z < 45° for AGASA). N is the total number of events 
in a datasct, S and B arc the corresponding rmmbors of av- 
erage signal and background coincidences. A^o is the minimal 
number of events which the experiment needs to accumulate 
in order to reach the current HiRes sensitivity. 



This formula provides a starting point for quantitative 
analysis of the compatibility of the two experiments. 
Changing variables to rji + 7^2 and rji — 772 and integrat- 
ing over r]i + 772 gives the probability distribution for the 
difference rji — rj2 which tells one how well the two experi- 
ments are compatible with each other within our hypoth- 
esis. This assumes that the number ny of pairs "source 
- cosmic ray" separated by the angle 6y observed in the 
experiment Y is known. 

Alternatively, assuming the likelihood function of 77 as 
follows from the experiment X, Eq. (6), one may pre- 
dict the number of pairs ny which should be observed 
by the experiment Y if the hypothesis is correct. The 
probability distribution for this number is 



P = 



i 

j drjCx 



{gyFyNr, + By)^- ^_,,p,n^_By 



nyl 



The 95% CL interval Ang^y^ = (711,7x2) for the number 
of pairs to be observed is defined by the conditions 

P [ni] = 0.025 , P [712] = 0.975 , 

m 

where P [m] = P„y 

nY=0 

(to be more precise, m is the largest integer for which 
P [7ii] < 0.025 and 712 is the smallest integer for which 
P[n2] > 0.975). 

In Table II we present the expected number of pairs 

"source - cosmic ray" for several future cosmic-ray data 
sets. The prediction is done for typical total numbers of 
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events expected to be observed. It is presented in the 
form of the 95% CL interval Ang5% — (721,712). If the 
number of actuaUy observed pairs falls within the indi- 
cated interval, it would be compatible with our hypoth- 
esis, otherwise it would exclude it with the probability 
95%. 

The ability to test the hypothesis is affected by many 
properties of the experiments. The most important ones 
are the angular resolution and the total number of events; 
however, favorable location in the Northern hemisphere 
(with more known BL Lacs in the field of view) also pro- 
vides a noticeable advantage. These parameters can be 
combined into a single quantity — signal-to- noise ratio Q 
— which characterizes the "sensitivity" of a given exper- 
iment to the BL Lac signal. It is defined as the ratio of 
the signal S to the typical fluctuation of the background. 

The absolute value of Q is related to the significance at 
which the existence/absence of the signal can be estab- 
lished. Namely, Q ^ 1 is required to test the pres- 
ence of the signal[20]. More importantly, different ex- 
periments can be compared in this way: the one with 
larger Q is more sensitive to the signal. Assuming that 
6 is chosen in the same way for all experiments (for in- 
stance, 6 = \pia as in the calculations above), the only 
experiment-dependent factor in Q is the first factor in 
Eq. (7). This factor determines the scaling of Q with 
the total number of events, the optimal angle B and the 
geometrical correction factor F. Thus, Eq. (7) allows to 
estimate the number of events which are necessary for the 
two experiments to have the same signal-to-noise ratio. 

Clearly, in order to test the hypothesis formulated on 
the basis of the HiRes data, another experiment has to 
have at least similar sensitivity. It is interesting to see 
how many events the experiments discussed above have 
to accumulate to reach the same signal-to-noise ratio as 
the HiRes experiment. These numbers are presented in 
Table II. One can see, for instance, that the Pierre Auger 
experiment will have to collect of order 500 events with 
E > 10^^ eV in the hybrid mode (and of order 3500 
events observed by the surface detector only) in order to 
fully probe the hypothesis formulated here. 

V. CONCLUSIONS 

To summarize, in this paper we have formulated, on 
the basis of the HiRes data, the quantitative hypothe- 



sis about the connection between UHECRs and BL Lacs 
(Sect. II, (i) (iii)) and defined the procedure to test this 
hypothesis in future experiments. The results are pre- 
sented in Table II. As follows from these results, the 
strong correlations observed in the HiRes data are con- 
sistent, at the 95% C.L., even with the absence of signal 
in some data sets. In particular, the final AGASA data 
set and the first release of the Pierre Auger data, both 
expected in near future, may not exhibit any correlations 
with this sample of BL Lacs and still be consistent with 
the correlation hypothesis formulated in Sec. II. On the 
contrary, the absence of correlations in the future Tele- 
scope Array and full Pierre Auger data may allow one to 
falsify this hypothesis. This difference is due to differ- 
ent angular resolutions, exposures and locations of these 
experiments. Several factors (e.g., the choice of the opti- 
mal angle Q with account of the shape of the individual 
point-spread function) may improve the sensitivity of the 
experiments compared to the indicative estimates given 
in Table II. 



In principle, other claims of correlations between 
UHECR and BL Lacs [4-6] can be tested quantitatively 
in a similar way. However, due to the cut adjustment 
(compensated by the penalty factor) to the best signal 
in these cases the simple analytic relation which in fact 
determines the flux of cosmic rays produced by BL Lacs, 
Eq. (6), is no longer valid; its analog may be calculated 
numerically. We leave this question for future work. 

The correlations of BL Lacs with the HiRes stereo 

data, if confirmed, would imply that a few-percent frac- 
tion of the highest-energy cosmic-ray fiux is composed 
of neutral particles coming from cosmological distances. 
This new UHECR puzzle would be as difficult to solve 
within conventional physics as the celebrated GZK cut- 
off [16] problem. 
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